Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 100968 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 13793 |
| Duplicate rows (%) | 13.7% |
| Total size in memory | 15.4 MiB |
| Average record size in memory | 160.0 B |
Variable types
| NUM | 15 |
|---|---|
| CAT | 5 |
Reproduction
| Analysis started | 2020-08-25 01:22:09.087976 |
|---|---|
| Analysis finished | 2020-08-25 01:22:57.242711 |
| Duration | 48.15 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
| Dataset has 13793 (13.7%) duplicate rows | Duplicates |
DRUG_TEST_TYPE_(3_of_3) is highly correlated with DRUG_TEST_RESULTS_(3_of_3) | High correlation |
DRUG_TEST_RESULTS_(3_of_3) is highly correlated with DRUG_TEST_TYPE_(3_of_3) | High correlation |
DRUG_TEST_RESULTS_(3_of_3) has 90394 (89.5%) zeros | Zeros |
EJECTION_PATH has 87477 (86.6%) zeros | Zeros |
DRUG_TEST_TYPE_(3_of_3) has 1068 (1.1%) zeros | Zeros |
CASE_STATE has 2271 (2.2%) zeros | Zeros |
DRUG_TEST_TYPE has 16153 (16.0%) zeros | Zeros |
RELATED_FACTOR_(3)-PERSON_LEVEL
Real number (ℝ≥0)
| Distinct count | 33 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.00750732905475 |
|---|---|
| Minimum | 0 |
| Maximum | 32 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 19 |
| median | 19 |
| Q3 | 19 |
| 95-th percentile | 19 |
| Maximum | 32 |
| Range | 32 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8078197301 |
|---|---|
| Coefficient of variation (CV) | 0.04250003517 |
| Kurtosis | 201.9375804 |
| Mean | 19.00750733 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9361049922 |
| Sum | 1919150 |
| Variance | 0.6525727163 |
| Value | Count | Frequency (%) | |
| 19 | 100362 | 99.4% | |
| 30 | 265 | 0.3% | |
| 8 | 128 | 0.1% | |
| 7 | 34 | < 0.1% | |
| 20 | 30 | < 0.1% | |
| 12 | 24 | < 0.1% | |
| 14 | 22 | < 0.1% | |
| 31 | 20 | < 0.1% | |
| 11 | 11 | < 0.1% | |
| 29 | 7 | < 0.1% | |
| 22 | 6 | < 0.1% | |
| 3 | 5 | < 0.1% | |
| 6 | 5 | < 0.1% | |
| 24 | 5 | < 0.1% | |
| 15 | 4 | < 0.1% | |
| 26 | 4 | < 0.1% | |
| 2 | 4 | < 0.1% | |
| 18 | 3 | < 0.1% | |
| 21 | 3 | < 0.1% | |
| 25 | 3 | < 0.1% | |
| 9 | 3 | < 0.1% | |
| 1 | 3 | < 0.1% | |
| 0 | 3 | < 0.1% | |
| 4 | 2 | < 0.1% | |
| 17 | 2 | < 0.1% | |
| Other values (8) | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3 | < 0.1% | |
| 1 | 3 | < 0.1% | |
| 2 | 4 | < 0.1% | |
| 3 | 5 | < 0.1% | |
| 4 | 2 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 5 | < 0.1% | |
| 7 | 34 | < 0.1% | |
| 8 | 128 | 0.1% | |
| 9 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 32 | 1 | < 0.1% | |
| 31 | 20 | < 0.1% | |
| 30 | 265 | 0.3% | |
| 29 | 7 | < 0.1% | |
| 28 | 1 | < 0.1% | |
| 27 | 1 | < 0.1% | |
| 26 | 4 | < 0.1% | |
| 25 | 3 | < 0.1% | |
| 24 | 5 | < 0.1% | |
| 23 | 2 | < 0.1% |
METHOD_OF_DRUG_DETERMINATION
Real number (ℝ≥0)
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.980082798510419 |
|---|---|
| Minimum | 0 |
| Maximum | 4 |
| Zeros | 621 |
| Zeros (%) | 0.6% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 3 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 3 |
| Maximum | 4 |
| Range | 4 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3367975239 |
|---|---|
| Coefficient of variation (CV) | 0.1130161632 |
| Kurtosis | 40.28719788 |
| Mean | 2.980082799 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -4.389624933 |
| Sum | 300893 |
| Variance | 0.1134325721 |
| Value | Count | Frequency (%) | |
| 3 | 94794 | 93.9% | |
| 4 | 2761 | 2.7% | |
| 2 | 2675 | 2.6% | |
| 0 | 621 | 0.6% | |
| 1 | 117 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 621 | 0.6% | |
| 1 | 117 | 0.1% | |
| 2 | 2675 | 2.6% | |
| 3 | 94794 | 93.9% | |
| 4 | 2761 | 2.7% |
| Value | Count | Frequency (%) | |
| 4 | 2761 | 2.7% | |
| 3 | 94794 | 93.9% | |
| 2 | 2675 | 2.6% | |
| 1 | 117 | 0.1% | |
| 0 | 621 | 0.6% |
METHOD_ALCOHOL_DETERMINATION
Real number (ℝ≥0)
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0442120275730926 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 359 |
| Zeros (%) | 0.4% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5662034135 |
|---|---|
| Coefficient of variation (CV) | 0.2769788094 |
| Kurtosis | 17.9583979 |
| Mean | 2.044212028 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.689822369 |
| Sum | 206400 |
| Variance | 0.3205863055 |
| Value | Count | Frequency (%) | |
| 2 | 84208 | 83.4% | |
| 1 | 7335 | 7.3% | |
| 3 | 7082 | 7.0% | |
| 4 | 1238 | 1.2% | |
| 6 | 721 | 0.7% | |
| 0 | 359 | 0.4% | |
| 5 | 25 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 359 | 0.4% | |
| 1 | 7335 | 7.3% | |
| 2 | 84208 | 83.4% | |
| 3 | 7082 | 7.0% | |
| 4 | 1238 | 1.2% | |
| 5 | 25 | < 0.1% | |
| 6 | 721 | 0.7% |
| Value | Count | Frequency (%) | |
| 6 | 721 | 0.7% | |
| 5 | 25 | < 0.1% | |
| 4 | 1238 | 1.2% | |
| 3 | 7082 | 7.0% | |
| 2 | 84208 | 83.4% | |
| 1 | 7335 | 7.3% | |
| 0 | 359 | 0.4% |
| Distinct count | 59 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 95.44155574043262 |
|---|---|
| Minimum | 0 |
| Maximum | 999 |
| Zeros | 90394 |
| Zeros (%) | 89.5% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 999 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 292.1212775 |
|---|---|
| Coefficient of variation (CV) | 3.060734658 |
| Kurtosis | 5.608476994 |
| Mean | 95.44155574 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.753366631 |
| Sum | 9636543 |
| Variance | 85334.84076 |
| Value | Count | Frequency (%) | |
| 0 | 90394 | 89.5% | |
| 999 | 9248 | 9.2% | |
| 1 | 703 | 0.7% | |
| 996 | 189 | 0.2% | |
| 695 | 45 | < 0.1% | |
| 606 | 39 | < 0.1% | |
| 417 | 35 | < 0.1% | |
| 351 | 28 | < 0.1% | |
| 407 | 25 | < 0.1% | |
| 410 | 24 | < 0.1% | |
| 321 | 21 | < 0.1% | |
| 603 | 19 | < 0.1% | |
| 401 | 18 | < 0.1% | |
| 998 | 18 | < 0.1% | |
| 402 | 14 | < 0.1% | |
| 600 | 12 | < 0.1% | |
| 605 | 11 | < 0.1% | |
| 997 | 10 | < 0.1% | |
| 343 | 10 | < 0.1% | |
| 187 | 8 | < 0.1% | |
| 376 | 8 | < 0.1% | |
| 177 | 8 | < 0.1% | |
| 155 | 7 | < 0.1% | |
| 304 | 6 | < 0.1% | |
| 513 | 5 | < 0.1% | |
| Other values (34) | 63 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 90394 | 89.5% | |
| 1 | 703 | 0.7% | |
| 100 | 1 | < 0.1% | |
| 136 | 2 | < 0.1% | |
| 145 | 1 | < 0.1% | |
| 155 | 7 | < 0.1% | |
| 156 | 1 | < 0.1% | |
| 157 | 1 | < 0.1% | |
| 165 | 3 | < 0.1% | |
| 167 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 999 | 9248 | 9.2% | |
| 998 | 18 | < 0.1% | |
| 997 | 10 | < 0.1% | |
| 996 | 189 | 0.2% | |
| 924 | 1 | < 0.1% | |
| 795 | 1 | < 0.1% | |
| 702 | 4 | < 0.1% | |
| 695 | 45 | < 0.1% | |
| 606 | 39 | < 0.1% | |
| 605 | 11 | < 0.1% |
| Distinct count | 10 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0845713493384042 |
|---|---|
| Minimum | 0 |
| Maximum | 9 |
| Zeros | 87477 |
| Zeros (%) | 86.6% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 9 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.843937008 |
|---|---|
| Coefficient of variation (CV) | 2.622176042 |
| Kurtosis | 3.406828131 |
| Mean | 1.084571349 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.30298195 |
| Sum | 109507 |
| Variance | 8.087977708 |
| Value | Count | Frequency (%) | |
| 0 | 87477 | 86.6% | |
| 9 | 10007 | 9.9% | |
| 7 | 1522 | 1.5% | |
| 6 | 701 | 0.7% | |
| 1 | 407 | 0.4% | |
| 3 | 327 | 0.3% | |
| 8 | 258 | 0.3% | |
| 5 | 168 | 0.2% | |
| 2 | 56 | 0.1% | |
| 4 | 45 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 87477 | 86.6% | |
| 1 | 407 | 0.4% | |
| 2 | 56 | 0.1% | |
| 3 | 327 | 0.3% | |
| 4 | 45 | < 0.1% | |
| 5 | 168 | 0.2% | |
| 6 | 701 | 0.7% | |
| 7 | 1522 | 1.5% | |
| 8 | 258 | 0.3% | |
| 9 | 10007 | 9.9% |
| Value | Count | Frequency (%) | |
| 9 | 10007 | 9.9% | |
| 8 | 258 | 0.3% | |
| 7 | 1522 | 1.5% | |
| 6 | 701 | 0.7% | |
| 5 | 168 | 0.2% | |
| 4 | 45 | < 0.1% | |
| 3 | 327 | 0.3% | |
| 2 | 56 | 0.1% | |
| 1 | 407 | 0.4% | |
| 0 | 87477 | 86.6% |
EXTRICATION
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 788.9 KiB |
| 1 | |
|---|---|
| 0 | 9826 |
| 2 | 1209 |
| Value | Count | Frequency (%) | |
| 1 | 89933 | 89.1% | |
| 0 | 9826 | 9.7% | |
| 2 | 1209 | 1.2% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 89933 | 89.1% | |
| 0 | 9826 | 9.7% | |
| 2 | 1209 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 100968 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 89933 | 89.1% | |
| 0 | 9826 | 9.7% | |
| 2 | 1209 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 100968 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 89933 | 89.1% | |
| 0 | 9826 | 9.7% | |
| 2 | 1209 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 100968 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 89933 | 89.1% | |
| 0 | 9826 | 9.7% | |
| 2 | 1209 | 1.2% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.258101576737184 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 1068 |
| Zeros (%) | 1.1% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.9057076606 |
|---|---|
| Coefficient of variation (CV) | 0.4010925239 |
| Kurtosis | 5.450893448 |
| Mean | 2.258101577 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.437227105 |
| Sum | 227996 |
| Variance | 0.8203063665 |
| Value | Count | Frequency (%) | |
| 2 | 90394 | 89.5% | |
| 5 | 9248 | 9.2% | |
| 0 | 1068 | 1.1% | |
| 6 | 117 | 0.1% | |
| 1 | 84 | 0.1% | |
| 3 | 46 | < 0.1% | |
| 4 | 11 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1068 | 1.1% | |
| 1 | 84 | 0.1% | |
| 2 | 90394 | 89.5% | |
| 3 | 46 | < 0.1% | |
| 4 | 11 | < 0.1% | |
| 5 | 9248 | 9.2% | |
| 6 | 117 | 0.1% |
| Value | Count | Frequency (%) | |
| 6 | 117 | 0.1% | |
| 5 | 9248 | 9.2% | |
| 4 | 11 | < 0.1% | |
| 3 | 46 | < 0.1% | |
| 2 | 90394 | 89.5% | |
| 1 | 84 | 0.1% | |
| 0 | 1068 | 1.1% |
| Distinct count | 51 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.4259765470248 |
|---|---|
| Minimum | 0 |
| Maximum | 50 |
| Zeros | 2271 |
| Zeros (%) | 2.2% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 9 |
| median | 22 |
| Q3 | 38 |
| 95-th percentile | 46 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 15.22821164 |
|---|---|
| Coefficient of variation (CV) | 0.6500566415 |
| Kurtosis | -1.416085416 |
| Mean | 23.42597655 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.07160162336 |
| Sum | 2365274 |
| Variance | 231.8984297 |
| Value | Count | Frequency (%) | |
| 4 | 10151 | 10.1% | |
| 43 | 9231 | 9.1% | |
| 9 | 7738 | 7.7% | |
| 10 | 4030 | 4.0% | |
| 32 | 3765 | 3.7% | |
| 33 | 3607 | 3.6% | |
| 38 | 3558 | 3.5% | |
| 13 | 3407 | 3.4% | |
| 22 | 3280 | 3.2% | |
| 35 | 3144 | 3.1% | |
| 2 | 2875 | 2.8% | |
| 42 | 2859 | 2.8% | |
| 25 | 2495 | 2.5% | |
| 40 | 2362 | 2.3% | |
| 0 | 2271 | 2.2% | |
| 18 | 2249 | 2.2% | |
| 46 | 2101 | 2.1% | |
| 14 | 2089 | 2.1% | |
| 17 | 1914 | 1.9% | |
| 30 | 1835 | 1.8% | |
| 5 | 1766 | 1.7% | |
| 49 | 1679 | 1.7% | |
| 24 | 1629 | 1.6% | |
| 36 | 1573 | 1.6% | |
| 47 | 1552 | 1.5% | |
| Other values (26) | 17808 | 17.6% |
| Value | Count | Frequency (%) | |
| 0 | 2271 | 2.2% | |
| 1 | 208 | 0.2% | |
| 2 | 2875 | 2.8% | |
| 3 | 1421 | 1.4% | |
| 4 | 10151 | 10.1% | |
| 5 | 1766 | 1.7% | |
| 6 | 696 | 0.7% | |
| 7 | 337 | 0.3% | |
| 8 | 200 | 0.2% | |
| 9 | 7738 | 7.7% |
| Value | Count | Frequency (%) | |
| 50 | 392 | 0.4% | |
| 49 | 1679 | 1.7% | |
| 48 | 862 | 0.9% | |
| 47 | 1552 | 1.5% | |
| 46 | 2101 | 2.1% | |
| 45 | 190 | 0.2% | |
| 44 | 735 | 0.7% | |
| 43 | 9231 | 9.1% | |
| 42 | 2859 | 2.8% | |
| 41 | 403 | 0.4% |
RELATED_FACTOR_(2)-PERSON_LEVEL
Real number (ℝ≥0)
| Distinct count | 48 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.935365660407257 |
|---|---|
| Minimum | 0 |
| Maximum | 47 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 29 |
| median | 29 |
| Q3 | 29 |
| 95-th percentile | 29 |
| Maximum | 47 |
| Range | 47 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.006200823 |
|---|---|
| Coefficient of variation (CV) | 0.069333868 |
| Kurtosis | 76.42562404 |
| Mean | 28.93536566 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -3.087831092 |
| Sum | 2921546 |
| Variance | 4.024841743 |
| Value | Count | Frequency (%) | |
| 29 | 99269 | 98.3% | |
| 12 | 419 | 0.4% | |
| 44 | 265 | 0.3% | |
| 18 | 265 | 0.3% | |
| 46 | 256 | 0.3% | |
| 4 | 88 | 0.1% | |
| 20 | 88 | 0.1% | |
| 10 | 60 | 0.1% | |
| 30 | 48 | < 0.1% | |
| 33 | 20 | < 0.1% | |
| 36 | 20 | < 0.1% | |
| 7 | 16 | < 0.1% | |
| 42 | 14 | < 0.1% | |
| 34 | 13 | < 0.1% | |
| 24 | 10 | < 0.1% | |
| 9 | 10 | < 0.1% | |
| 26 | 9 | < 0.1% | |
| 17 | 8 | < 0.1% | |
| 23 | 8 | < 0.1% | |
| 35 | 6 | < 0.1% | |
| 14 | 6 | < 0.1% | |
| 15 | 6 | < 0.1% | |
| 8 | 6 | < 0.1% | |
| 47 | 5 | < 0.1% | |
| 40 | 4 | < 0.1% | |
| Other values (23) | 49 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 2 | < 0.1% | |
| 2 | 3 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 88 | 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 3 | < 0.1% | |
| 7 | 16 | < 0.1% | |
| 8 | 6 | < 0.1% | |
| 9 | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| 47 | 5 | < 0.1% | |
| 46 | 256 | 0.3% | |
| 45 | 1 | < 0.1% | |
| 44 | 265 | 0.3% | |
| 43 | 3 | < 0.1% | |
| 42 | 14 | < 0.1% | |
| 41 | 1 | < 0.1% | |
| 40 | 4 | < 0.1% | |
| 39 | 3 | < 0.1% | |
| 38 | 3 | < 0.1% |
ALCOHOL_TEST_TYPE
Real number (ℝ≥0)
| Distinct count | 10 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.839265906029633 |
|---|---|
| Minimum | 0 |
| Maximum | 9 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 4 |
| median | 4 |
| Q3 | 9 |
| 95-th percentile | 9 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.329189455 |
|---|---|
| Coefficient of variation (CV) | 0.398883951 |
| Kurtosis | -1.534252753 |
| Mean | 5.839265906 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.4883436165 |
| Sum | 589579 |
| Variance | 5.425123519 |
| Value | Count | Frequency (%) | |
| 4 | 55405 | 54.9% | |
| 9 | 33550 | 33.2% | |
| 6 | 9810 | 9.7% | |
| 2 | 1447 | 1.4% | |
| 5 | 268 | 0.3% | |
| 7 | 231 | 0.2% | |
| 8 | 144 | 0.1% | |
| 1 | 92 | 0.1% | |
| 3 | 18 | < 0.1% | |
| 0 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3 | < 0.1% | |
| 1 | 92 | 0.1% | |
| 2 | 1447 | 1.4% | |
| 3 | 18 | < 0.1% | |
| 4 | 55405 | 54.9% | |
| 5 | 268 | 0.3% | |
| 6 | 9810 | 9.7% | |
| 7 | 231 | 0.2% | |
| 8 | 144 | 0.1% | |
| 9 | 33550 | 33.2% |
| Value | Count | Frequency (%) | |
| 9 | 33550 | 33.2% | |
| 8 | 144 | 0.1% | |
| 7 | 231 | 0.2% | |
| 6 | 9810 | 9.7% | |
| 5 | 268 | 0.3% | |
| 4 | 55405 | 54.9% | |
| 3 | 18 | < 0.1% | |
| 2 | 1447 | 1.4% | |
| 1 | 92 | 0.1% | |
| 0 | 3 | < 0.1% |
POLICE-REPORTED_DRUG_INVOLVEMENT
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 788.9 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 6282 |
| 0 | 1626 |
| Value | Count | Frequency (%) | |
| 2 | 74725 | 74.0% | |
| 1 | 18335 | 18.2% | |
| 3 | 6282 | 6.2% | |
| 0 | 1626 | 1.6% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 74725 | 74.0% | |
| 1 | 18335 | 18.2% | |
| 3 | 6282 | 6.2% | |
| 0 | 1626 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 100968 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 74725 | 74.0% | |
| 1 | 18335 | 18.2% | |
| 3 | 6282 | 6.2% | |
| 0 | 1626 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 100968 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 74725 | 74.0% | |
| 1 | 18335 | 18.2% | |
| 3 | 6282 | 6.2% | |
| 0 | 1626 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 100968 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 74725 | 74.0% | |
| 1 | 18335 | 18.2% | |
| 3 | 6282 | 6.2% | |
| 0 | 1626 | 1.6% |
POLICE_REPORTED_ALCOHOL_INVOLVEMENT
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 788.9 KiB |
| 1 | |
|---|---|
| 0 | |
| 2 | |
| 3 |
| Value | Count | Frequency (%) | |
| 1 | 45448 | 45.0% | |
| 0 | 34810 | 34.5% | |
| 2 | 10799 | 10.7% | |
| 3 | 9911 | 9.8% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 45448 | 45.0% | |
| 0 | 34810 | 34.5% | |
| 2 | 10799 | 10.7% | |
| 3 | 9911 | 9.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 100968 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 45448 | 45.0% | |
| 0 | 34810 | 34.5% | |
| 2 | 10799 | 10.7% | |
| 3 | 9911 | 9.8% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 100968 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 45448 | 45.0% | |
| 0 | 34810 | 34.5% | |
| 2 | 10799 | 10.7% | |
| 3 | 9911 | 9.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 100968 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 45448 | 45.0% | |
| 0 | 34810 | 34.5% | |
| 2 | 10799 | 10.7% | |
| 3 | 9911 | 9.8% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2418885191347755 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 16153 |
| Zeros (%) | 16.0% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.539762086 |
|---|---|
| Coefficient of variation (CV) | 0.6868147423 |
| Kurtosis | 0.05216568456 |
| Mean | 2.241888519 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.6995243504 |
| Sum | 226359 |
| Variance | 2.37086728 |
| Value | Count | Frequency (%) | |
| 2 | 64072 | 63.5% | |
| 0 | 16153 | 16.0% | |
| 5 | 15805 | 15.7% | |
| 6 | 1972 | 2.0% | |
| 1 | 1422 | 1.4% | |
| 4 | 1304 | 1.3% | |
| 3 | 240 | 0.2% |
| Value | Count | Frequency (%) | |
| 0 | 16153 | 16.0% | |
| 1 | 1422 | 1.4% | |
| 2 | 64072 | 63.5% | |
| 3 | 240 | 0.2% | |
| 4 | 1304 | 1.3% | |
| 5 | 15805 | 15.7% | |
| 6 | 1972 | 2.0% |
| Value | Count | Frequency (%) | |
| 6 | 1972 | 2.0% | |
| 5 | 15805 | 15.7% | |
| 4 | 1304 | 1.3% | |
| 3 | 240 | 0.2% | |
| 2 | 64072 | 63.5% | |
| 1 | 1422 | 1.4% | |
| 0 | 16153 | 16.0% |
AGE
Real number (ℝ≥0)
| Distinct count | 99 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.10670707550907 |
|---|---|
| Minimum | 0 |
| Maximum | 99 |
| Zeros | 446 |
| Zeros (%) | 0.4% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 20 |
| median | 32 |
| Q3 | 49 |
| 95-th percentile | 81 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 22.10964127 |
|---|---|
| Coefficient of variation (CV) | 0.5958394859 |
| Kurtosis | 0.2693850882 |
| Mean | 37.10670708 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.8633521931 |
| Sum | 3746590 |
| Variance | 488.836237 |
| Value | Count | Frequency (%) | |
| 18 | 3594 | 3.6% | |
| 19 | 3432 | 3.4% | |
| 20 | 3140 | 3.1% | |
| 21 | 3111 | 3.1% | |
| 17 | 3100 | 3.1% | |
| 22 | 2634 | 2.6% | |
| 99 | 2552 | 2.5% | |
| 16 | 2449 | 2.4% | |
| 23 | 2329 | 2.3% | |
| 24 | 2173 | 2.2% | |
| 25 | 2042 | 2.0% | |
| 26 | 1794 | 1.8% | |
| 28 | 1768 | 1.8% | |
| 27 | 1740 | 1.7% | |
| 30 | 1722 | 1.7% | |
| 29 | 1689 | 1.7% | |
| 37 | 1664 | 1.6% | |
| 39 | 1646 | 1.6% | |
| 31 | 1639 | 1.6% | |
| 38 | 1634 | 1.6% | |
| 40 | 1615 | 1.6% | |
| 35 | 1606 | 1.6% | |
| 41 | 1587 | 1.6% | |
| 36 | 1574 | 1.6% | |
| 42 | 1570 | 1.6% | |
| Other values (74) | 47164 | 46.7% |
| Value | Count | Frequency (%) | |
| 0 | 446 | 0.4% | |
| 1 | 623 | 0.6% | |
| 2 | 633 | 0.6% | |
| 3 | 594 | 0.6% | |
| 4 | 621 | 0.6% | |
| 5 | 534 | 0.5% | |
| 6 | 536 | 0.5% | |
| 7 | 583 | 0.6% | |
| 8 | 548 | 0.5% | |
| 9 | 595 | 0.6% |
| Value | Count | Frequency (%) | |
| 99 | 2552 | 2.5% | |
| 97 | 23 | < 0.1% | |
| 96 | 11 | < 0.1% | |
| 95 | 21 | < 0.1% | |
| 94 | 23 | < 0.1% | |
| 93 | 30 | < 0.1% | |
| 92 | 62 | 0.1% | |
| 91 | 93 | 0.1% | |
| 90 | 92 | 0.1% | |
| 89 | 136 | 0.1% |
SEATING_POSITION
Real number (ℝ≥0)
| Distinct count | 26 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.990700023769907 |
|---|---|
| Minimum | 0 |
| Maximum | 25 |
| Zeros | 7 |
| Zeros (%) | < 0.1% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 3 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 16 |
| Maximum | 25 |
| Range | 25 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 4.794033728 |
|---|---|
| Coefficient of variation (CV) | 0.8002459995 |
| Kurtosis | 3.601701575 |
| Mean | 5.990700024 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.955037901 |
| Sum | 604869 |
| Variance | 22.98275939 |
| Value | Count | Frequency (%) | |
| 3 | 57331 | 56.8% | |
| 6 | 18779 | 18.6% | |
| 8 | 6442 | 6.4% | |
| 16 | 5263 | 5.2% | |
| 13 | 4857 | 4.8% | |
| 14 | 2134 | 2.1% | |
| 25 | 1768 | 1.8% | |
| 9 | 1734 | 1.7% | |
| 4 | 928 | 0.9% | |
| 11 | 341 | 0.3% | |
| 17 | 219 | 0.2% | |
| 19 | 203 | 0.2% | |
| 22 | 198 | 0.2% | |
| 18 | 150 | 0.1% | |
| 20 | 140 | 0.1% | |
| 23 | 105 | 0.1% | |
| 12 | 100 | 0.1% | |
| 15 | 75 | 0.1% | |
| 10 | 68 | 0.1% | |
| 7 | 58 | 0.1% | |
| 5 | 24 | < 0.1% | |
| 24 | 20 | < 0.1% | |
| 21 | 14 | < 0.1% | |
| 0 | 7 | < 0.1% | |
| 2 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 7 | < 0.1% | |
| 1 | 4 | < 0.1% | |
| 2 | 6 | < 0.1% | |
| 3 | 57331 | 56.8% | |
| 4 | 928 | 0.9% | |
| 5 | 24 | < 0.1% | |
| 6 | 18779 | 18.6% | |
| 7 | 58 | 0.1% | |
| 8 | 6442 | 6.4% | |
| 9 | 1734 | 1.7% |
| Value | Count | Frequency (%) | |
| 25 | 1768 | 1.8% | |
| 24 | 20 | < 0.1% | |
| 23 | 105 | 0.1% | |
| 22 | 198 | 0.2% | |
| 21 | 14 | < 0.1% | |
| 20 | 140 | 0.1% | |
| 19 | 203 | 0.2% | |
| 18 | 150 | 0.1% | |
| 17 | 219 | 0.2% | |
| 16 | 5263 | 5.2% |
RESTRAINT_SYSTEM-USE
Real number (ℝ≥0)
| Distinct count | 12 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.400394184296014 |
|---|---|
| Minimum | 0 |
| Maximum | 11 |
| Zeros | 60 |
| Zeros (%) | 0.1% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 5 |
| median | 7 |
| Q3 | 7 |
| 95-th percentile | 11 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.900097461 |
|---|---|
| Coefficient of variation (CV) | 0.2968719436 |
| Kurtosis | 1.432523386 |
| Mean | 6.400394184 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.7345974513 |
| Sum | 646235 |
| Variance | 3.610370362 |
| Value | Count | Frequency (%) | |
| 7 | 41462 | 41.1% | |
| 5 | 40763 | 40.4% | |
| 11 | 9017 | 8.9% | |
| 8 | 2973 | 2.9% | |
| 4 | 2523 | 2.5% | |
| 6 | 1898 | 1.9% | |
| 1 | 1542 | 1.5% | |
| 10 | 470 | 0.5% | |
| 9 | 131 | 0.1% | |
| 2 | 73 | 0.1% | |
| 0 | 60 | 0.1% | |
| 3 | 56 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 60 | 0.1% | |
| 1 | 1542 | 1.5% | |
| 2 | 73 | 0.1% | |
| 3 | 56 | 0.1% | |
| 4 | 2523 | 2.5% | |
| 5 | 40763 | 40.4% | |
| 6 | 1898 | 1.9% | |
| 7 | 41462 | 41.1% | |
| 8 | 2973 | 2.9% | |
| 9 | 131 | 0.1% |
| Value | Count | Frequency (%) | |
| 11 | 9017 | 8.9% | |
| 10 | 470 | 0.5% | |
| 9 | 131 | 0.1% | |
| 8 | 2973 | 2.9% | |
| 7 | 41462 | 41.1% | |
| 6 | 1898 | 1.9% | |
| 5 | 40763 | 40.4% | |
| 4 | 2523 | 2.5% | |
| 3 | 56 | 0.1% | |
| 2 | 73 | 0.1% |
SEX
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 788.9 KiB |
| 1 | |
|---|---|
| 0 | |
| 2 | 1655 |
| Value | Count | Frequency (%) | |
| 1 | 65740 | 65.1% | |
| 0 | 33573 | 33.3% | |
| 2 | 1655 | 1.6% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 65740 | 65.1% | |
| 0 | 33573 | 33.3% | |
| 2 | 1655 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 100968 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 65740 | 65.1% | |
| 0 | 33573 | 33.3% | |
| 2 | 1655 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 100968 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 65740 | 65.1% | |
| 0 | 33573 | 33.3% | |
| 2 | 1655 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 100968 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 65740 | 65.1% | |
| 0 | 33573 | 33.3% | |
| 2 | 1655 | 1.6% |
TAKEN_TO_HOSPITAL
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 788.9 KiB |
| 2 | |
|---|---|
| 0 | |
| 1 | 1914 |
| Value | Count | Frequency (%) | |
| 2 | 52355 | 51.9% | |
| 0 | 46699 | 46.3% | |
| 1 | 1914 | 1.9% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 52355 | 51.9% | |
| 0 | 46699 | 46.3% | |
| 1 | 1914 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 100968 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 52355 | 51.9% | |
| 0 | 46699 | 46.3% | |
| 1 | 1914 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 100968 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 52355 | 51.9% | |
| 0 | 46699 | 46.3% | |
| 1 | 1914 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 100968 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 52355 | 51.9% | |
| 0 | 46699 | 46.3% | |
| 1 | 1914 | 1.9% |
PERSON_TYPE
Real number (ℝ≥0)
| Distinct count | 10 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.155831550590286 |
|---|---|
| Minimum | 0 |
| Maximum | 9 |
| Zeros | 744 |
| Zeros (%) | 0.7% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.551538049 |
|---|---|
| Coefficient of variation (CV) | 0.808515286 |
| Kurtosis | -1.826692742 |
| Mean | 3.155831551 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.3444991946 |
| Sum | 318638 |
| Variance | 6.510346414 |
| Value | Count | Frequency (%) | |
| 1 | 57480 | 56.9% | |
| 6 | 36812 | 36.5% | |
| 7 | 5331 | 5.3% | |
| 0 | 744 | 0.7% | |
| 8 | 234 | 0.2% | |
| 2 | 225 | 0.2% | |
| 5 | 103 | 0.1% | |
| 3 | 34 | < 0.1% | |
| 4 | 3 | < 0.1% | |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 744 | 0.7% | |
| 1 | 57480 | 56.9% | |
| 2 | 225 | 0.2% | |
| 3 | 34 | < 0.1% | |
| 4 | 3 | < 0.1% | |
| 5 | 103 | 0.1% | |
| 6 | 36812 | 36.5% | |
| 7 | 5331 | 5.3% | |
| 8 | 234 | 0.2% | |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9 | 2 | < 0.1% | |
| 8 | 234 | 0.2% | |
| 7 | 5331 | 5.3% | |
| 6 | 36812 | 36.5% | |
| 5 | 103 | 0.1% | |
| 4 | 3 | < 0.1% | |
| 3 | 34 | < 0.1% | |
| 2 | 225 | 0.2% | |
| 1 | 57480 | 56.9% | |
| 0 | 744 | 0.7% |
target
Real number (ℝ≥0)
| Distinct count | 8 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7829213216068456 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 9 |
| Zeros (%) | < 0.1% |
| Memory size | 788.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.853606206 |
|---|---|
| Coefficient of variation (CV) | 0.666064898 |
| Kurtosis | -1.300771467 |
| Mean | 2.782921322 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.4629695938 |
| Sum | 280986 |
| Variance | 3.435855968 |
| Value | Count | Frequency (%) | |
| 1 | 42116 | 41.7% | |
| 4 | 20007 | 19.8% | |
| 2 | 15072 | 14.9% | |
| 5 | 13890 | 13.8% | |
| 6 | 8674 | 8.6% | |
| 7 | 901 | 0.9% | |
| 3 | 299 | 0.3% | |
| 0 | 9 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 9 | < 0.1% | |
| 1 | 42116 | 41.7% | |
| 2 | 15072 | 14.9% | |
| 3 | 299 | 0.3% | |
| 4 | 20007 | 19.8% | |
| 5 | 13890 | 13.8% | |
| 6 | 8674 | 8.6% | |
| 7 | 901 | 0.9% |
| Value | Count | Frequency (%) | |
| 7 | 901 | 0.9% | |
| 6 | 8674 | 8.6% | |
| 5 | 13890 | 13.8% | |
| 4 | 20007 | 19.8% | |
| 3 | 299 | 0.3% | |
| 2 | 15072 | 14.9% | |
| 1 | 42116 | 41.7% | |
| 0 | 9 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| RELATED_FACTOR_(3)-PERSON_LEVEL | METHOD_OF_DRUG_DETERMINATION | METHOD_ALCOHOL_DETERMINATION | DRUG_TEST_RESULTS_(3_of_3) | EJECTION_PATH | EXTRICATION | DRUG_TEST_TYPE_(3_of_3) | CASE_STATE | RELATED_FACTOR_(2)-PERSON_LEVEL | ALCOHOL_TEST_TYPE | POLICE-REPORTED_DRUG_INVOLVEMENT | POLICE_REPORTED_ALCOHOL_INVOLVEMENT | DRUG_TEST_TYPE | AGE | SEATING_POSITION | RESTRAINT_SYSTEM-USE | SEX | TAKEN_TO_HOSPITAL | PERSON_TYPE | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 19 | 3 | 2 | 0 | 9 | 1 | 2 | 0 | 29 | 9 | 3 | 3 | 5 | 34 | 3 | 7 | 1 | 0 | 1 | 1 |
| 1 | 19 | 3 | 2 | 0 | 9 | 1 | 2 | 0 | 29 | 4 | 1 | 0 | 2 | 20 | 3 | 7 | 1 | 0 | 1 | 1 |
| 2 | 19 | 3 | 2 | 0 | 0 | 0 | 2 | 0 | 29 | 4 | 1 | 0 | 2 | 43 | 3 | 5 | 1 | 0 | 1 | 1 |
| 3 | 19 | 3 | 2 | 0 | 0 | 0 | 2 | 0 | 29 | 4 | 2 | 1 | 2 | 38 | 6 | 5 | 0 | 2 | 6 | 2 |
| 4 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 0 | 29 | 9 | 3 | 3 | 5 | 50 | 3 | 5 | 1 | 2 | 1 | 1 |
| 5 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 0 | 29 | 4 | 1 | 0 | 2 | 40 | 3 | 5 | 0 | 2 | 1 | 2 |
| 6 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 0 | 29 | 4 | 1 | 0 | 2 | 50 | 3 | 7 | 1 | 0 | 1 | 4 |
| 7 | 19 | 3 | 2 | 0 | 9 | 1 | 2 | 0 | 29 | 6 | 2 | 1 | 5 | 69 | 6 | 7 | 0 | 0 | 6 | 1 |
| 8 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 0 | 29 | 9 | 3 | 0 | 5 | 94 | 8 | 7 | 1 | 0 | 7 | 1 |
| 9 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 0 | 29 | 9 | 3 | 0 | 1 | 47 | 3 | 5 | 1 | 0 | 1 | 4 |
Last rows
| RELATED_FACTOR_(3)-PERSON_LEVEL | METHOD_OF_DRUG_DETERMINATION | METHOD_ALCOHOL_DETERMINATION | DRUG_TEST_RESULTS_(3_of_3) | EJECTION_PATH | EXTRICATION | DRUG_TEST_TYPE_(3_of_3) | CASE_STATE | RELATED_FACTOR_(2)-PERSON_LEVEL | ALCOHOL_TEST_TYPE | POLICE-REPORTED_DRUG_INVOLVEMENT | POLICE_REPORTED_ALCOHOL_INVOLVEMENT | DRUG_TEST_TYPE | AGE | SEATING_POSITION | RESTRAINT_SYSTEM-USE | SEX | TAKEN_TO_HOSPITAL | PERSON_TYPE | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 100958 | 19 | 3 | 2 | 0 | 9 | 1 | 2 | 50 | 29 | 4 | 2 | 0 | 2 | 63 | 6 | 7 | 0 | 2 | 6 | 2 |
| 100959 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 50 | 29 | 9 | 2 | 0 | 0 | 79 | 3 | 7 | 1 | 2 | 1 | 1 |
| 100960 | 19 | 3 | 2 | 0 | 9 | 1 | 2 | 50 | 29 | 9 | 2 | 0 | 0 | 79 | 3 | 7 | 0 | 0 | 1 | 1 |
| 100961 | 19 | 3 | 2 | 0 | 7 | 1 | 2 | 50 | 29 | 4 | 2 | 0 | 2 | 32 | 3 | 7 | 1 | 2 | 1 | 2 |
| 100962 | 19 | 3 | 2 | 0 | 7 | 1 | 2 | 50 | 29 | 4 | 2 | 0 | 2 | 29 | 6 | 7 | 0 | 0 | 6 | 1 |
| 100963 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 50 | 29 | 4 | 2 | 0 | 2 | 10 | 13 | 5 | 0 | 2 | 6 | 6 |
| 100964 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 50 | 29 | 4 | 2 | 0 | 2 | 9 | 16 | 5 | 0 | 2 | 6 | 6 |
| 100965 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 50 | 29 | 4 | 2 | 0 | 2 | 7 | 14 | 4 | 0 | 2 | 6 | 6 |
| 100966 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 50 | 29 | 4 | 2 | 0 | 2 | 4 | 14 | 4 | 0 | 2 | 6 | 6 |
| 100967 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 50 | 29 | 4 | 2 | 2 | 2 | 61 | 3 | 7 | 1 | 0 | 1 | 1 |
Most frequent
| RELATED_FACTOR_(3)-PERSON_LEVEL | METHOD_OF_DRUG_DETERMINATION | METHOD_ALCOHOL_DETERMINATION | DRUG_TEST_RESULTS_(3_of_3) | EJECTION_PATH | EXTRICATION | DRUG_TEST_TYPE_(3_of_3) | CASE_STATE | RELATED_FACTOR_(2)-PERSON_LEVEL | ALCOHOL_TEST_TYPE | POLICE-REPORTED_DRUG_INVOLVEMENT | POLICE_REPORTED_ALCOHOL_INVOLVEMENT | DRUG_TEST_TYPE | AGE | SEATING_POSITION | RESTRAINT_SYSTEM-USE | SEX | TAKEN_TO_HOSPITAL | PERSON_TYPE | target | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7453 | 19 | 3 | 2 | 999 | 9 | 2 | 5 | 38 | 29 | 6 | 3 | 2 | 5 | 99 | 25 | 11 | 2 | 1 | 1 | 7 | 94 |
| 3092 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 14 | 29 | 4 | 2 | 1 | 2 | 99 | 25 | 11 | 2 | 0 | 6 | 4 | 91 |
| 1616 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 4 | 29 | 4 | 3 | 2 | 2 | 99 | 3 | 11 | 2 | 0 | 1 | 7 | 79 |
| 7876 | 30 | 3 | 2 | 999 | 9 | 2 | 5 | 38 | 44 | 6 | 3 | 2 | 5 | 99 | 25 | 11 | 2 | 1 | 6 | 7 | 68 |
| 622 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 0 | 29 | 4 | 2 | 1 | 2 | 99 | 6 | 5 | 2 | 0 | 6 | 4 | 67 |
| 3893 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 30 | 29 | 4 | 2 | 1 | 2 | 99 | 9 | 7 | 2 | 2 | 6 | 3 | 53 |
| 5914 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 43 | 29 | 4 | 2 | 1 | 2 | 99 | 3 | 11 | 2 | 1 | 1 | 7 | 47 |
| 1615 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 4 | 29 | 4 | 3 | 2 | 2 | 99 | 3 | 11 | 2 | 0 | 1 | 4 | 45 |
| 3016 | 19 | 3 | 2 | 0 | 0 | 1 | 2 | 13 | 29 | 4 | 2 | 2 | 2 | 99 | 3 | 11 | 2 | 0 | 1 | 4 | 43 |
| 7452 | 19 | 3 | 2 | 999 | 9 | 2 | 5 | 38 | 29 | 6 | 3 | 2 | 5 | 99 | 25 | 11 | 2 | 1 | 1 | 1 | 36 |